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RAW SEQUENCE LISTING DATE: 11/27/2001 

PATENT APPLICATION: US/09/837,138 TIME: 10:44:23 

Input Set : N :\Crf 3\RULE60\09837138 • raw 
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1 <110> APPLICANT: Petrini, John H.J. 

2 Morgan, William Franklin 

3 Maser, Richard Scott 

4 Carney, James Patrick 

5 <120> TITLE OF INVENTION: DNA Encoding A DNA Repair Protein 

6 <130> FILE REFERENCE: 800.019US1 

7 <140> CURRENT APPLICATION NUMBER: 09/837,138 

8 <141> CURRENT FILING DATE: 2001-04-18 

9 <150> PRIOR APPLICATION NUMBER: 09/067,641 

10 <151> PRIOR FILING DATE: 1999-12-03 

11 <160> NUMBER OF SEQ ID NOS : 24 

12 <170> SOFTWARE: FastSEQ for Windows Version 3.0 

14 <210> SEQ. ID NO: 1 

15 <211> LENGTH: 4403 

16 <212> TYPE: DNA 

17 <213> ORGANISM: Homo sapiens 

18 <400> SEQUENCE: 1 

19 ttcggcacga ggcgcggttg cacgtcggcc ccagccctga ggagccggac cgatgtggaa 60 

20 actgctgccc gccgcgggcc cggcaggagg agaaccatac agacttttga ctggcgttga 120 

21 gtacgttgtt ggaaggaaaa actgtgccat tctaattgaa aatgatcagt cgatcagccg 180 

22 aaatcatgct gtgttaactg ctaacttttc tgtaaccaac ctgagtcaaa cagatgaaat 240 

23 ccctgtattg acattaaaag ataattctaa gtatggtacc tttgttaatg aggaaaaaat 300 

24 gcagaatggc ttttcccgaa ctttgaagtc gggggatggt attacttttg gagtgtttgg 360 

25 aagtaaattc agaatagagt atgagccttt ggttgcatgc tcttcttgtt tagatgtctc 420 

26 tgggaaaact gctttaaatc aagctatatt gcaacttgga ggatttactg taaacaattg 480 

27 gacagaagaa tgcactcacc ttgtcatggt atcagtgaaa gttaccatta aaacaatatg 540 

28 tgcactcatt tgtggacgtc caattgtaaa gccagaatat tttactgaat tcctgaaagc 600 

29 agttcagtcc aagaagcagc ctccacaaat tgaaagtttt tacccacctc ttgatgaacc 660 

30 atctattgga agtaaaaatg ttgatctgtc aggacggcag gaaagaaaac aaatcttcaa 720 

31 agggaaaaca tttatatttt tgaatgccaa acagcataag aaattgagtt ccgcagttgt 780 

32 ctttggaggt ggggaagcta ggttgataac agaagagaat gaagaagaac ataatttctt 84 0 

33 tttggctccg ggaacgtgtg ttgttgatac aggaataaca aactcacaga ccttaattcc 900 

34 tgactgtcag aagaaatgga ttcagtcaat aatggatatg ctccaaaggc aaggtcttag 960 

35 acctattcct gaagcagaaa ttggattggc ggtgattttc atgactacaa agaattactg 1020 

36 tgatcctcag ggccatccca gtacaggatt aaagacaaca actccaggac caagcctttc 1080 

37 acaaggcgtg tcagttgatg aaaaactaat gccaagcgcc ccagtgaaca ctacaacata 1140 

38 cgtagctgac acagaatcag agcaagcaga tacatgggat ttgagtgaaa ggccaaaaga 1200 

39 aatcaaagtc tccaaaatgg aacaaaaatt cagaatgctt tcacaagacg cacccactgt 1260 

40 aaaggagtcc tgcaaaacaa gctctaataa taatagtatg gtatcaaata ctttggctaa 1320 

41 gatgagaatc ccaaactatc agctttcacc aactaaattg ccaagtataa ataaaagtaa 1380 

42 agatagggct tctcagcagc agcagaccaa ctccatcaga aactactttc agccgtctac 1440 

43 \ caaaaaaagg gaaagggatg aagaaaatca agaaatgtct tcatgcaaat cagcaagaat 1500 

44 ""agaaacgtct tgttctcttt tagaacaaac acaacctgct acaccctcat tgtggaaaaa 1560 

45 taaggagcag catctatctg agaatgagcc tgtggacaca aactcagaca ataacttatt 1620 

46 tacagataca gatttaaaat ctattgtgaa aaattctgcc agtaaatctc atgctgcaga 1680 

47 aaagctaaga tcaaataaaa aaagggaaat ggatgatgtg gccatagaag atgaagtatt 1740 

48 ggaacagtta ttcaaggaca caaaaccaga gttagaaatt gatgtgaaag ttcaaaaaca 1800 
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49 ggaggaagat gtcaatgtta gaaaaaggcc aaggatggat atagaaacaa atgacacttt 1860 

50 cagtgatgaa gcagtaccag aaagtagcaa aatatctcaa gaaaatgaaa ttgggaagaa 1920 

51 acgtgaactc aaggaagact cactatggtc agctaaagaa atatctaaca atgacaaact 1980 

52 tcaggatgat agtgagatgc ttccaaaaaa gctgttattg actgaattta gatcactggt 2040 

53 gattaaaaac tctacttcca gaaatccgtc tggcataaat gatgattatg gtcaactaaa 2100 

54 aaatttcaag aaattcaaaa aggtcacata tcctggagca ggaaaacttc cacacatcat 2160 

55 tggaggatca gatctaatag ctcatcatgc tcgaaagaat acagaactag aagagtggct 2220 

56 aaggcaggaa atggaggtac aaaatcaaca tgcaaaagaa gagtctcttg ctgatgatct 2280 

57 ttttagatac aatccttatt taaaaaggag aagataactg aggattttaa aaagaagcca 2340 

58 tggaaaaact tcctagtaag catctacttc aggccaacaa ggttatatga atatatagtg 2400 

59 tatagaagcg atttaagtta caatgtttta tggcctaaat ttattaaata aaatgcacaa 24 60 

60 aactttgatt cttttgtatg taacaattgt ttgtyctgtt ttcaggcttt gtcattgcat 2520 

61 ctttttttca tttttaaatg tgttttgttt attaaatagt taatatagtc acagttcaaa 2580 

62 attctaaatr tacgtaaggt aaaggactaa agtcaccctt ccaccattgt cctagctact 2640 

63 tatttttaaa taatttccta cacaaatgat agcataacat atgcagtgtt ctacaccttg 2700 

64 cttttttact tagtaagatt aaaaattata ggaatatcaa tataatgttt ttaatatttt 2760 

65 ttcttttcca ttatgctgta gtcttaccta aactctggtg atccaaacaa aatggcttca 2820 

66 gtggtgcaga tgtcacctac atgttattct agtactagaa actgaagacc atgtggagac 2880 

67 ttcatcaaac atgggtttag ttttcaccag aatggaaaga cctgtacccc tttttggtgg 2940 

68 tcttactgag ctgggtgggt gtctgttttg agcttattta gagtcctagt tttcctactt 3000 

69 ataaagtaga aatggtgaga ttgttttctt tttctacckt aaagggagat ggtaagaaac 3060 

70 aatgaatgtc ttttttcaaa ctttattgac aagtgatttt caagtctgtg ttcaaaaata 3120 

71 tattcatgta cctgtgatcc agcaagaagg gagttccagt caagagtcac tacaactgat 3180 

72 tagttgttta gagaatgaga aatggaacag tgaggaatgg aggccatatt tccatgactt 3240 

73 cccttgtaaa cagaagcaac agaagggaca agaggctggc ctctacatca ctctcacctt 3300 

74 ccaaatcttg tggaagtgca tctacttgcc agaaccaaat taacttactt ccaagttctg 3360 

75 gctgcttgca ggtggaactc cagctgcaag ggagttaggg aaatgaaggt ctttttttaa 3420 

76 aagcttctca gccttcctag ggaacagaaa ttgggtgagc caatctgcaa tttctactac 3480 

77 aggcattgag accagttaga ttattgaaat attatagaga gttatgaaca cttaaattat 3540 

78 gatagtggta tgacattgga tagaacatgg gatactttag aagtagaatt gacagggcat 3600 

79 attagttgat gaaatggagt catttgagtc tyttaatagc catgtatcat aattaccaag 3660 

80 tgaagctggt ggaacatatg gtctccattt tacagttaag gaatataatg gacagattaa 3720 

81 tattgttytc tgtcatgccc acaatccctt tctaaggaag actgccctac tatagcagtt 3780 

82 tttatatttg tcaatttatg aatataatga atgaggagtt ctggtacctc ctgtctttac 3840 

83 aaatattggg tgttgtccag tatttttccc tttttaaccm ttcccaattc gggtgtgtag 3900 

84 gtggatgttt ccatttgggt tttaatttgt atatccctga tagctataat tgggtcatag 3960 

85 aaattcttta tacattctag atgcaagtct cttgycggat atacgtattg agatattaca 4020 

86 cctagtctgt ggcttgactg ttttctttat gtcttttgat gaatagaagt tttaaatttt 4080 

87 gacaaggtca aatttatttt tttcttttgt ttgatatttt ttctctccaa tttaacccca 4140 

88 agatttcaga tattctgctc tattatataa actttatatt tttatatttg tgatctacct 4200 

89 tgaattgata tgtatgttgt gaattatgga tcagggttct ttttttcccc catacaagta 4260 

90 tccagtcatt gtaacactgt ttattgaaag aattatcctt tcctcattaa attaccttgc 4320 

91 caattagtaa aaaatcaatt aaccatrmar mmmrrrggat ccactagttc tagagcggcc 4380 

92 gccaccgcgg tggagctcca get 4403 



94 <210> SEQ ID NO: 2 

95 <211> LENGTH: 754 

96 <212> TYPE: PRT 

97 <213> ORGANISM: Homo sapiens 

98 <4 00> SEQUENCE: 2 
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148 385 390 395 400 

149 Pro Thr Val Lys Glu Ser Cys Lys Thr Ser Ser Asn Asn Asn Ser Met 

150 405 410 415 

151 Val Ser Asn Thr Leu Ala Lys Met Arg lie Pro Asn Tyr Gin Leu Ser 

152 420 425 430 

153 Pro Thr Lys Leu Pro Ser lie Asn Lys Ser Lys Asp Arg Ala Ser Gin 

154 435 440 445 

155 Gin Gin Gin Thr Asn Ser lie Arg Asn Tyr Phe Gin Pro Ser Thr Lys 

156 450 455 460 

157 Lys Arg Glu Arg Asp Glu Glu Asn Gin Glu Met Ser Ser Cys Lys Ser 

158 465 470 475 480 

159 Ala Arg lie Glu Thr Ser Cys Ser Leu Leu Glu Gin Thr Gin Pro Ala 

160 485 490 495 

161 Thr Pro Ser Leu Trp Lys Asn Lys Glu Gin His Leu Ser Glu Asn Glu 

162 500 505 510 

163 Pro Val Asp Thr Asn Ser Asp Asn Asn Leu Phe Thr Asp Thr Asp Leu 

164 515 520 525 

165 Lys Ser lie Val Lys Asn Ser "Ala Ser Lys Ser His Ala Ala Glu Lys 

166 530 535 > 540 

167 Leu Arg Ser Asn Lys Lys Arg Glu Met Asp Asp Val Ala lie Glu Asp 

168 545 550 555 560 

169 Glu Val Leu Glu Gin Leu Phe Lys Asp Thr Lys Pro Glu Leu Glu lie 

170 565 570 575 

171 Asp Val Lys Val Gin Lys Gin Glu Glu Asp Val Asn Val Arg Lys Arg 

172 580 585 590 

173 Pro Arg Met Asp lie Glu Thr Asn Asp Thr Phe Ser Asp Glu Ala Val 

174 595 600 605 

175 Pro Glu Ser Ser Lys lie Ser Gin Glu Asn Glu lie Gly Lys Lys Arg 

176 610 615 620 

177 Glu Leu Lys Glu Asp Ser Leu Trp Ser Ala Lys Glu lie Ser Asn Asn 

178 625 630 635 640 

179 Asp Lys Leu Gin Asp Asp Ser Glu Met Leu Pro Lys Lys Leu Leu Leu 

180 645 650 655 

181 Thr Glu Phe Arg Ser Leu Val lie Lys Asn Ser Thr Ser Arg Asn Pro 

182 660 665 670 

183 Ser Gly lie Asn Asp Asp Tyr Gly Gin Leu Lys Asn Phe Lys Lys Phe 

184 675 680 685 

185 Lys Lys Val Thr Tyr Pro Gly Ala Gly Lys Leu Pro His lie lie Gly 

186 690 695 700 

187 Gly Ser Asp Leu lie Ala His His Ala Arg Lys Asn Thr Glu Leu Glu 

188 705 710 715 720 

189 Glu Trp Leu Arg Gin Glu Met Glu Val Gin Asn Gin His Ala Lys Glu 

190 725 730 735 

191 Glu Ser Leu Ala Asp Asp Leu Phe Arg Tyr Asn Pro Tyr Leu Lys Arg 

192 740 745 750 

193 Arg Arg 



195 <210> SEQ ID NO: 3 

196 <211> LENGTH: 87 

197 <212> TYPE: PRT 
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198 <213> ORGANISM: Homo sapiens 

199 <220> FEATURE: 

200 <221> NAME/KEY: UNSURE 

201 <222> LOCATION: (48)... (48) 

202 <223> OTHER INFORMATION: Unsure 

203 <4 00> SEQUENCE: 3 



204 Tyr Val Val Gly Arg Lys Asn Cys Ala lie Leu lie Glu Asn Asp Gin 

205 1 5 10 15 

206 Ser lie Ser Arg Asn His Ala Val Leu Thr Ala Asn Phe Ser Val Thr 

207 20 25 30 

W--> 208 Asn Leu Ser Gin Thr Asp Glu lie Pro Val Leu Thr Leu Lys Asn Xaa 

209 35 40 -45 

210 Lys Tyr Gly Thr Phe Val Asn Glu Glu Lys Met Gin Asn Gly Phe Ser 

211 50 55 60 

212 Arg Thr Leu Lys Ser Val Asp Gly lie Thr Phe Gly Val Phe Gly Ser 

213 65 70 75 80 

214 Lys Phe Arg lie Glu Tyr Glu 

215 85 



217 <210> SEQ ID NO: 4 

218 <211> LENGTH: 87 

219 <212> TYPE: PRT 

220 <213> ORGANISM: Homo sapiens 

221 <400> SEQUENCE: 4 



222 Tyr Ser lie Gly Arg Ser Ser Lys Asn Pro Leu lie lie Lys Asn Asp 

223 15 10 15 

224 Lys Ser lie Ser Arg Gin His lie Thr Phe Lys Trp Glu lie Asn Asn 

225 20 25 30 

226 Ser Ser Asp Leu Lys His Ser Ser Leu Cys Leu Val Asn Lys Gly Lys 

227 35 40 45 

228 Leu Thr Ser Leu Asn Lys Lys Phe Met Lys Val Gly Glu Thr Phe Thr 

229 50 55 60 

2 30 lie Asn Ala Ser Cys Val Leu Lys Ser Thr lie Glu Leu Gly Thr Thr 

231 65 70 75 80 

232 Pro He Arg He Glu Phe Glu 

233 85 



235 <210> SEQ ID NO: 5 

236 <211> LENGTH: 13 

237 <212> TYPE: PRT 

238 <213> ORGANISM: Homo sapiens 

239 <400> SEQUENCE: 5 

240 Asn Pro Ser Gly Leu Asn Asp Asp Tyr Gly Gin- Leu Lys 

241 15 10 

243 <210> SEQ ID NO: 6 

244 <211> LENGTH: 680 
24 5 <212> TYPE: PRT 

246 <213> ORGANISM: Homo sapiens 

247 <400> SEQUENCE: 6 

24 8 Met Ser Thr Ala Asp Ala Leu Asp Asp Glu Asn Thr Phe Lys He Leu 

249 15 10 15 
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